BOPA: A Bayesian hierarchical model for outlier expression detection

نویسندگان

  • Zhaoping Hong
  • Heng Lian
چکیده

DNA microarray technologies have the capability of simultaneously measuring the abundance of thousands of gene expressions in cells. A common task with microarrays is to determine which genes are differentially expressed under two different biological conditions of interest (e.g. cancerous against non-cancerous cells). It is often the case that there are thousands of genes for a single individual but relatively few individuals in the data set. Additionally, in many cancer studies, a gene may be expressed in some but not all of the disease samples, reflecting the complexity of the underlying disease. Traditional t-tests assume a mean shift for the tumor samples compared to normal samples and is thus not structured to capture partial differential expression. More powerful tests specially designed for this situation are needed to find genes with heterogeneous expressions associated with possible subtypes of the cancer. This thesis proposes a Bayesian model for cancer outlier profile analysis (BOPA). We build on the Gamma-Gamma model introduced in Newton et al. (2001); Kendziorski et al. (2003) and Newton et al. (2004), by using a five-component mixture model to represent various differential expression patterns. The hierarchical mixture model explicitly accounts for outlier expressions and inferences are based on samples from posterior distributions generated from a Markov chain Monte Carlo algorithm. We present simulation and real-life datasets analysis to demonstrate our proposed methodology.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Posterior Predictive Outlier Detection Using Sample Reweighting

In a Bayesian model, we de ne an outlier as an observation which is \surprising" relative to its predictive distribution, under the model, given the remainder of the data. Hence \outlyingness" can be measured by the posterior predictive p-value of any interesting scalar summary of the (possibly multivariate) observation. For this calculation, we exclude the case of interest from the data, analo...

متن کامل

Bayesian change point estimation in Poisson-based control charts

Precise identification of the time when a process has changed enables process engineers to search for a potential special cause more effectively. In this paper, we develop change point estimation methods for a Poisson process in a Bayesian framework. We apply Bayesian hierarchical models to formulate the change point where there exists a step < /div> change, a linear trend and a known multip...

متن کامل

A Bayesian Hierarchical Model for Noise Reduction in Low-field Magnetic Resonance Imaging

In current magnetic resonance imaging (“MRI”) systems, low-field MRI has the advantage of low cost and open. The signal-to-noise ratio (“SNR”) obtained, however, is relatively low. This study thus aims at developing a Bayesian multi-stage hierarchical model with an outlier-detection ability, through the use of a heavy-tailed prior that can be used to reduce the effects of noise introduced. Sinc...

متن کامل

Identification of outliers types in multivariate time series using genetic algorithm

Multivariate time series data, often, modeled using vector autoregressive moving average (VARMA) model. But presence of outliers can violates the stationary assumption and may lead to wrong modeling, biased estimation of parameters and inaccurate prediction. Thus, detection of these points and how to deal properly with them, especially in relation to modeling and parameter estimation of VARMA m...

متن کامل

Detecting Stage-Wise Outliers in Hierarchical Bayesian Linear Models of Repeated Measures Data

We propose numerical and graphical methods for outlier detection in hierarchical Bayes modeling and analyses of repeated measures regression data from multiple subjects; data from a single subject are generically called a “curve.” The first-stage of our model has curve-specific regression coefficients with possibly autoregressive errors of a prespecified order. The first-stage regression vector...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 56  شماره 

صفحات  -

تاریخ انتشار 2012